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On Abstract 

o 

In the last few years the so-called "linear deterministic" model of relay channels has gained 
04 popularity as a means of studying the flow of information over wireless communication networks, 

^ and this approach generalizes the model of wireline networks which is standard in network 

optimization. There is recent work extending the celebrated max-flow/min-cut theorem to the 
capacity of a unicast session over a linear deterministic relay network which is modeled by a 
1^ layered directed graph. This result was first proved by a random coding scheme over large 

blocks of transmitted signals. We demonstrate the same result with a simple, deterministic, 
polynomial-time algorithm which takes as input a single transmitted signal instead of a long 
r ' block of signals. Our capacity-achieving transmission scheme for a two-layer network requires 

the extension of a one-dimensional Rado-Hall transversal theorem on the independent subsets 
^ of rows of a row-partitioned matrix into a two-dimensional variation for block matrices. To 

O generalize our approach to larger networks we use the submodularity of the capacity of a cut 

for our model and show that our complete transmission scheme can be obtained by solving 
a linear program over the intersection of two polymatroids. We prove that our transmission 
^ scheme can achieve the max-flow/min-cut capacity by applying a theorem of Edmonds about 

JT^ such linear programs. We use standard submodular function minimization techniques as part 

of our polynomial-time algorithm to construct our capacity-achieving transmission scheme. 

<N 

^ 1 Introduction 

O 

o> 

Q Network information theory [7, Ch. 15] attempts to model aspects of large communication networks 

such as interference, cooperation, and noise that are often overlooked in network optimization theory. 
Relay channels [7, §15.7] are an example of a network information problem in which there is a source, 
a unique destination, and at least one intermediary transmitter-receiver pair which is instrumental 
to the communication between the source and the destination. In this paper we focus on a simplified 
mathematical model for the wireless relay channel which we describe via a directed graph M{V,S), 
where V denotes the set consisting of the source node, the destination node, and all relay nodes. 
Here the source node only sends signals, the destination node only receives signals, and a relay node 
can both receive signals and transmit any function of its incoming messages. The transmission of 
signals in this network obeys two rules: 1- any signal which is sent by node i is broadcast to every 
node j such that ij & S, and 2- the signal which is received by node j is a linear combination of all 
signals that are broadcast to it and an independent additive noise signal which is typically modeled 
by a Gaussian random variable. Finding the capacity of a wireless relay channel has long been a 
challenging and important open problem. Avestimehr, Diggavi, and Tse [6, 5] recently proposed a 
simplified linear deterministic relay network model in which the effects of broadcasting, interference. 
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and noise are captured by linear transformations of the transmitted signals. One motivation for the 
stud)' of this model is the result in [6, 5] that the capacity of any wireless relay channel with Gaussian 
noise is within a constant additive factor of the capacity of a corresponding linear deterministic 
model. More recent work [11, 19] has connected the linear deterministic model to the approximate 
capacity of other relay channels and to the design of near-optimal coding schemes for them. We 
next summarize the model and some of the results of [6, 5]: 

The authors of [6, 5] focus on layered directed graphs Af with set of nodes V = Oi U ©2 U • • • U 
Om- Here Oi = {01(1)} = {5} and Om = {C'm(I)} = {^} respectively denote the source and 
destination nodes, and Oi = {Oj(l),--- ,Oi(mi)} denotes the set of relay nodes in the zth layer 
for ?■ G {2, • • • , Af — 1}. Each edge in the graph is from some node in Oi,i <E {1, • • • ,M—1}, to a 
node in O^+i. Observe that the study of an arbitrary directed network can also be placed into this 
framework if one instead studies its time-expanded representation [6, 5]. In this case Oi corresponds 
to the possible behaviors of the network from symbol time {i — l)r to symbol time it — 1 for some 
positive integer r. 

In the layered network, every node Oi{j),i G {2, • • • , M — 1} , receives a column vector yj(j) at 
time t and transmits a column vector x*(j) at time t. Node S transmits vector = x* and node V 
receives vector = y^. The elements of all transmitted and received vectors belong to some fixed 
finite field ¥q. Ever}' edge Oi{j)Oi+i{k) G £ represents a communication channel which is described 
by a matrix Gi{j,k) with entries from Fg called the transfer function from x^(fc) to vector yj_^_l{j). 
Define the block matrix Gi by Gj = [Gi{j, fe)] for j G {1, • • • , rrii+i} = [mj+i] and k G [mi] . Let x* 
be the transmitted vector from layer i and y*^^ be the received vector at layer i + i.e.. 



x* = 



x^(mi) 



and y*+i = 



yf+i(i) 



y|+i(mi+i) 



The communication channel from layer i to layer i + 1 is characterized by the following relationship: 



Yi+i 



Gi 



Xi 



Assume the communication session begins at time 1 and ends at time r and has desired rate 
of transmission R; i.e., node S wishes to send message u which is chosen randomly from a set 
{1, • • • , q'^^] of messages to node V at the end of the session. The communication protocol proceeds 
from layer to layer. Node Oiij) transmits vectors x|(j), • • • ,x[(j) to nodes in layer Oj+i, and node 
transmits signals to the nodes in next layer after receiving vectors y}_^_i{k), ■ ■ ■ ,yl_^_i{k). At 
every time instant t, vector x^ is some function of a; and x*(j) is some function of y|(j), • • • ,y[(j). 
There are two natural questions about this model: First, what is the capacity or maximum rate of 
information in this network? Second, among capacity-achieving schemes, how can one optimize the 
duration r and the complexity of the relay functions used? 

The first of these questions was initially addressed in [6, 5]. To study the capacity C of the 
network, we first define a cut as a subset of the nodes V. A cut separates S from D if 5 G $7 and 
V £ 0, = V\r2. The transfer function of the cut 17, is defined as a block diagonal matrix with 

(M — 1) X (M — 1) blocks. The i*'' diagonal block, Gi{il) is the submatrix of Gi consisting of the 
transfer functions from the transmitted vectors of the nodes in ClCiOi to the received vectors of the 
nodes in n Oj+i for i G [M — 1] , and each off-diagonal block is an all-zero matrix. Avestimehr, 
Diggavi and Tse used a min-cut upper bound on the rate of transmissions [7, Thm. 15.10.1] and a 
random coding argument to show 



Theorem 1. For any cut ft, C{fl) = rank{G{il,)) = X]j=i rank{Gi(Q)). Furthermore, the capacity 
of network N as defined above is C = niin^ separates S and 
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The achievability argument in [6, 5] is based on a linear scheme in which source node S ini- 
tially encodes the message uj as a vector in F^^ denoted by ys{^)- Node S and relay node Oi{j) 
respectively generate transmitted signals by the linear transformations 















= Fs ■ y5(w) and 




= F.U) • 








. xKi) . 







It is shown in [6, 5] that if the encoding matrices and Fi{j) are chosen randomly with a uniform 
distribution over the space of all matrices over the field ¥q, if r is sufficiently large, and if R < C, 
then the destination node V will, with probability approaching 1, receive tR linearly independent 
linear combinations of the message vector ysi^) from which it will be able to decode message lo. 

Since the complexity of the transmission scheme in [6, 5] is large and increases with r, we seek 
a deterministic, low-complexity transmission scheme that is capacity-achieving and processes only 
one signal x*(j) at a time for each i and j, i.e., r = 1. We remark that [4] considered similar issues 
for transmissions over a binary field. We will discuss the approach of [4] in Section 1.2. 

1.1 Our Results and Techniques 

Our algorithm has two main steps. First we propose an algorithm to transmit signals from layer Oi, 
i G {!,■ ■ ■ ,M — l},to layer Oj+i in an optimal way. In the second step we extend our algorithm to 
the full network and prove that it is capacity-achieving. Since our transmission scheme manipulates 
only one signal x*(j) at a time for each i and j, we hereafter drop the time superscript. 
We define a flow of the block matrix Gi as follows: 

Definition 2. Let dj = , ^j(mj); • • • ,£j+i(mj+i)) be a vector of non-negative 

integers that satisfies X]j=i-^iO) = Z^j^i^ -^i+iO) = -^dr We say that matrix Gi supports flow dj 
if there exists a full rank R^. x R^. submatrix G^i of Gi such that G^i is an intersection of li{j) 
columns of the jih column block of Gi, j G [rrij] , with 4+1 (fe) rows of the kth row block of Gi, 
k G [mj+i]. (See Figure l.(a).) We further say that such a submatrix Gd, is a solution for flow dj. 

For the physical interpretation of flow, suppose matrix Gi supports flow dj. Consider the sub- 
vector of Xj and the subvector yd^+i of yj+i which correspond to the transfer matrix G^i- 
Furthermore, let Xd.(j) and ydj+i(^) respectively denote the parts of these subvectors that belong 
to vector Xj(j) and yj+i(fc). If the entries of Xj which are not part of Xd^ are set to zero, then yd^+i 
will uniquely determine Xd^ since Gd^ is a full rank matrix. Hence R^^ units of information flow 
from the nodes in Oi to the nodes in Oj+i during a transmission. We next introduce the notion of 
the flow from Oi to Om supported by network M. For convenience we will consider a more general 
network M with an arbitrary number of nodes in the first and last layers as opposed to the single 
node each at the first and last layers of relay channel models. 

Definition 3. Suppose non-negative integers ^i(l),--- ,£i{mi), and £m(1),-'' ^^Mijn-M) satisfy 
Erii^i(i) = E7={^M(i) = R. We say that vector d = (4(1), ••• , 4(mi); £m(1): • • • .4u(^"A/)) 
is a rate-i? flow supported by network M if for every i G {2, • • • , M — 1} there exists non-negative 
integers £i{l),--- ,£i{mi) such that vector dj = ,£j{mj);£j^i{l), ■ ■ ■ , £j_|_i(mj+i)), j G 

[M — 1] , is a rate-i? flow supported by matrix Gj. (See Figure l.(b).) 

Every flow for network M is determined by the submatrices Gd^ and the corresponding row and 
column indices of Gj. Let us return to the case where has a single node each in the first and last 
layers. Suppose that network Af supports a rate-i? flow d = {R; R) . Then given Gd^, i G [M — 1] , 
a simple coding scheme that achieves rate R can be defined as follows: 
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(a) 



Figure 1: (a) An example of a matrix flow for vector dj = (2, 2; 1, 3) in a matrix Gi with four blocks. 
Each small square is an entry of Gi and matrix Gd, is the intersection of dashed rows with dashed 
columns, (b) An example of network flow d = (3; 3) in a network with four layers. The solid part 
of vector yi{j) denotes the entries of ydi(j) and the dashed part of vector Xj(j) denotes the entries 
of Xd,(j)- The flow vectors are di = (3; 1,2), da = (1,2; 2,1), and ds = (2,1; 3). 



Transmission Scheme: 

Given the length-i? encoded vector ys{u;), node S generates vector = xi by setting xj^ to the 
vector ysi^) and by setting the other entries of x^ to zero. The transformation at every relay 
node Oi{j) is similar: after receiving vector yi(j), node Oi{j) extracts the subvector ydj(j) with 
length £i{j) and sets Xd.(j) = ydi(i)- The remaining entries of Xj(j) are set to zero. Finally node 
V first decodes subvector y^,^., from the received vector yj) = yM and then extracts the encoded 
message ys{uj). Observe that for every i, x^^ = G^^ ■ ya^+i and xj- = yd,- These imply that 
ys{^^) = G^^G'^^ ■ ■ ■ G^^^ ^YdM- Since the matrices Gd, are nonsingular, the decoding operation is 
well defined. 

Our main technical result in Section 2 is the following theorem providing the necessary and 
sufficient conditions for matrix Gi to support a flow d^: 

Theorem 4. For any subsets U C [nii] and W C [mj+i] define the block matrix Gi{W, U) as the 
intersection of the row blocks of Gi having indices in W with the column blocks of Gi having indices 
in U. Matrix Gi supports a flow dj if and only if for all U C [mi] and for all W C [mj+i] 

rank{G,{W, U)) > Y^^iU) + Yl " ^d,- (2) 

This combinatorial property of matrices is, to our knowledge, the first two-dimensional result 
of this type and may be of independent interest in the theory of matrices. Theorem 4 holds for 
matrices with entries from an arbitrary field and is therefore more general than its application for 
this relay problem. We prove the necessity of Theorem 4 by examining the relationships of the 
ranks of various submatrices of block matrix Gd ■ 

Our sufficiency argument is more technical and involves a divide-and-conquer procedure and 
an inductive argument to prove the existence of G^^ ■ The basis of our inductive argument will be 
the Rado-Hall transversal theorem [25, Ch. 7], which states the necessary and sufficient conditions 
for the existence of independent structures in a collection of subsets of elements of a matroid. In 
the special case of matrices we have: 
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Theorem 5. (The Rado-Hall Theorem) Let H be a matrix with column blocks labeled I,-- - ,m. 
For U C [m] let HiJJ) denote the submatrix of H which is formed by the column blocks with indices 
in U. Given non-negative integers £i, - • • ,irn> there are ii + • • • + im linearly independent columns 
of H with exactly ii of the columns from H{{i}), i G [m] , if and only if for every U we have 
rank{H{U))>j:,^uei. 

In the study of matrices Theorem 4 can be viewed as an extension of the Rado-Hall Theorem. 
The Rado-Hall Theorem has man)' variations for matroids with different constraints on the car- 
dinalities of the independent sets [18]. However, the rank function of a matroid is, by definition, 
both submodular and nondecreasing. The columns of a matrix form a ground set for a matroid, 
and the rank function of a subset of columns is defined as the dimension of the subspace spanned 
b}^ these columns; this is also true if the columns arc replaced by the rows of the matrix. We next 
define a natural extension of the rank function on the union of all rows and columns of a matrix 
which is motivated by the cuts of a network. Let P and Q respectively denote the set of rows and 
columns of the matrix. Define the rank function of a subset T Q V U Q where T = P U Q with 
pop and Q C Q, to be the dimension of the spanning subspace of the submatrix which is formed 
by the intersection of the rows in P and the columns which arc not in Q. In Lemma 18 we study 
this rank function for the block matrices of network M and prove that it is submodular. In that 
discussion the nodes of the two consecutive layers of the network respectively represent the column 
and row blocks of a matrix. We comment that the rank function arising from the study of cuts 
in our model has an important difference from the rank function in earlier transversal theorems. 
Observe that since the rank of both sets and U Q is zero, the rank function we introduce is 
not monotone. Korte and Lovasz [15] initiated the study of a generalization of matroids known as 
grccdoids with rank functions which are monotone but not necessarily submodular. They have also 
derived a transversal theory of greedoids similar to the Rado-Hall Theorem [15]. Theorem 4 appears 
to be the first extension of the Rado-Hall Theorem in which the rank function is submodular but 
not monotone. This is interesting because monotonicity plays a central role in the proofs of the 
previous results. 

In Section 3, we prove the following extension of Theorem 4: 

Theorem 6. A network M with capacity C supports flow d = (i?; R) if and only if R <C. 

Edmonds and Giles [10] introduced a generahzation of classical network fiow known as sub- 
modular flow, where the classical fiow conservation constraints are replaced by submodular fiow 
constraints on certain subsets of nodes. We prove a submodularity property of the cut function 
in Lemma 19, and it is possible to show that our notion of flow for network is a special case 
of submodular flow. The Edmonds and Giles theory of submodular flow docs not directly imply 
variations of the max-fiow min-cut theorems. We therefore study our fiow using an earlier result of 
Edmonds [9]. We prove in Section 3 that the maximum fiow in our setting is a linear programming 
optimization over the intersection of two polymatroids. We apply a corollary of the polymatroid 
intersection theorem [9] to show that the maximum rate of a flow is the capacity' C of the network 
and that the corresponding fiow can be achieved by non-negative integer vectors dj. 

In the appendix, we demonstrate an algorithm to construct a capacity-achieving code for network 
J\f which is strongly polynomial time in the size of the graph and in the size of the matrices Gj. In 
first step of the analysis we show that: 

Theorem 7. Given matrix Gi and an achievable flow vector di, the submatrix can be computed 

in polynomial time. 

We can in principle use the divide-and-conquer argument for the proof of Theorem 4 to obtain a 
recursive algorithm for this problem, but since the analysis is difficult we take a different approach. 
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We will demonstrate that testing the conditions of Theorem 4 for matrix Gj and flow dj is equivalent 
to showing that a certain submodular function has a non-negative minimum. It is well known (see, 
e.g., [23, 14]) that there are polynomial-time algorithms to find the minimum of a submodular 
function. Our algorithm to construct checks which rows of Gi can be removed without violating 
the conditions of Theorem 4 and then removes them one by one. The same procedure is next repeated 
for the columns. The final part of this step is to establish that the remaining matrix is a valid choice 
for Gdr 

The second step establishes the following result. 

Theorem 8. The capacity C of a deterministic relay network M can be computed in polynomial 
time. Given the network flow vector d = {R; R) with R < C, the flow vectors dj for every matrix 
Gi, i E {!,■ ■ ■ , M — 1} , can be computed in polynomial time. 

We prove this theorem in the appendix by applying the algorithm in [17] for optimizing a linear 
function over the intersection of two potymatroids. Theorems 7 and 8 together imply a polynomial- 
time algorithm for finding a transmission scheme for network J\f. 

1.2 Related Work 

Earlier work [3], [22] obtained capacity results for a different type of deterministic relay network 
in which the nodes broadcast data but the signals are received without interference. The paper [4] 

considers the same problem we address here, but restricts Fy to a binary field. The approach of 
[4] is based on a path augmentation argument similar to the Ford-Fulkerson algorithm (see, e.g., 
[12]) and involves a new network in which every node Oi{j) is replaced by several nodes with each 
corresponding to a different entry of vector Xj(j) or yi{j). In the new network there is an edge 
between a pair of nodes if the analogous entr)' in the transfer function Gi{k,j) is equal to one. 
For an edge e, we respectively denote its tail and head by .x(e) and y(e). Edges ei,--- ,ex are 
said to be independent if the transfer function from the vector (x(ei),-- - ,x{eK)) to the vector 
(?/(ei), ■ • • ,y{eK)) has full rank. The scheme in [4] finds C disjoint paths in the new network, where 
every cut that separates S from V intersects C independent edges of these paths. There does not 
appear to be a natural way to extend the approach of [4] to arbitrary finite fields. We use a different 
procedure to construct the full rank submatrices needed for our transmission scheme. 

Remark. The missing proofs in the body of the paper can be found in the appendix. 

2 Proof of Theorem 4 

At several steps of our proof we remove rows or columns from matrix Gi to find a suitable submatrix 
Gdi- Unless otherwise stated, assume that each such intermediate submatrix T of Gi maintains the 
same partition of row blocks and column blocks as original matrix Gj. In other words, each row 
(column) of T consists of a subset of the entries of some row (column) of Gi, and the two rows 
(columns) have the same row (column) block index in their respective matrices. For U C [rrii] and 
W C [mj+i], let the block submatrix T{W, U) denote the intersection of the row blocks of T having 
indices in W with the column blocks of T having indices in U. Notice that some row and/or column 
blocks of T may be empty. 

2,1 Proof of Necessity 

For U C [nii] and W C [rm+i], Gd,(VF, U) is a submatrix of Gi{W, U). Therefore, x<mk{Gi{W, U)) > 
rank(Gdj(W, U)). From the submodularity of the rank function we have: 
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rank(Gd,(K+i] , U)) < rank(Gd,(W, U)) + rank(Gd,(K+i] \W, U)). (3) 
Since is a full rank square matrix all of its columns are independent and hence 

rank(Gd,([m,+i],C/)) = ^4(j)- (4) 

jeu 

By the monotonicity of the rank function 

rank(Gd,(K+i] \W, U)) < rank(Gd,(K+i] \W, [rm])). (5) 
Since all of the rows of Gd^ are independent we have 

rank(Gd,([mi+i] [mi])) = ^ = i?d, - ^ li+i{k). (6) 

k&[mi+i\\W keW 

The relations (3)-(6) imply the necessity of the condition 

vankiGiiW, U)) > ^ 40') + ^ £i+i{k) - Rd,- 
jeu kew 

2.2 Proof of Sufficiency 

Assume throughout this subsection that the conditions of Theorem 4 are satisfied. First suppose 
that mj+i = 1 and W = {1} .Then for every set U C [rrii] the inequality (2) reduces to: 

vank{Gi{{l} ,U)) > ^ + ^i+i(l) - i^d,- 
j&U 

Since the definition of vector implies £i+i{l) = Rd,, it follows that rank(Gj({l} , U)) > J2jeU ^iU)- 
By Theorem 5, there exists a rank-iidj submatrix Gj of matrix Gj which consists a subset of £i{j) 
columns, for j G [mi], from each column block Gj({l},{j}). Since rank(Gi) = R^^, Gi has a 
submatrix Gd, consisting of Rdi independent rows. Gd^ is a solution for flow dj. 

We can similarly argue the existence of a solution Gd, for fiow dj when = 1. Next suppose 
mi > 2 and mj+i > 2. We will use induction and a divide-and-conquer procedure to prove the 
sufficiency result. For our inductive hypothesis we assume that Theorem 4 is true for any block 
matrix Gi consisting of rij+i x rii blocks where Ui < mi, nj+i < mj+i, and mj+i x mi ^ n^+i x n^. 

Let U C [m^i] and W C [mj+i]. We say that Gi{W, U) is a tight submatrix of Gj if the inequality 
(2) holds with equality for U and W. 

Lemma 9. Either Gi has a tight submatrix or Gi has a submatrix T for which for all U C [mi] and 
W C [mi+i], 

rank{T{W, U)) > ^ ^j) + ^ £,+i{k) - Rd, (7) 
jeu kew 

and T has a tight submatrix; i.e., (7) hold with equality for some U C [mi] and W C [mj+i] . In the 
latter case we replace Gi with T for the rest of the proof of Theorem 4- 

By our previous argument, Gi has one or more tight submatrices. We call a tight submatrix 
Gi{W, U) a proper submatrix if it is not one of the following cases: 
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1. \U\ = rrii and \W\ = 1, or 

2. \U\ = 1 and \W\ = rui+i. 

For the rest of the proof of Theorem 4 we need a proper submatrix. We have the following result: 

Lemma 10. Either (a) Gi has a proper submatrix or (h) it has a submatrix T which satisfies (7) 
for all U C [nii] and W C [mj+i] and T has a proper submatrix T{W, U). If case (b) applies then 
we replace Gi with the corresponding submatrix T for the rest of the proof of Theorem 4- 

Let P = Gi{W, U) be a proper submatrix of Gi. Next we reorder and relabel the row blocks and 
the column blocks of Gi such that P = Gi{[ni+i] , [rii]). It is straightforward to reverse the ordering 
and relabeling operation at the end of our argument. We label the four parts of matrix Gi as the 
following, where A,B, and/or L may possibly be empty matrices: 



Gi 



P A 
B L 



We denote by Ga the matrix [ P ^ ] • We further consider a partition of Ga into blocks as the 
following: 





G^{l,[m]) 


Gi{l,ni + l) 




Gi{l,mi) 


Ga = 












_ Gi{ni+i, [rii]) 


Gi{ni+i,ni + 1) 




Gi{ni+i,mi) 



Next consider the vector 

dA = (rank(P),^j(nj + l),ii{ni + 2), • • • ,ii{mi);£i+i{l), ■ ■ ■ , ii+iirii+i)) 
We verify that is a valid flow vector. By the tightness of matrix P we have: 



rank(P) = ^£,+i(j) 



Therefore we have 



E 

k=l 



rrii 



rii+i 



rrii 



j=l k=ni+l 



'li+i 



rank(P) 

Lemma 11. Matrix Ga supports flow d^. 



j=ni+l 



,{j) = Y,^i+iik) = RA. 



(8) 



k=l 



Let — [ At ] he the submatrix of Ga corresponding to a solution for flow d^; here 
Pt is a submatrix of P and At is a submatrix of A. We let G'^^ = [ Pc At ] , where Pc is the 
submatrix of P consisting of the rows of P that are used for the construction of Pf, Pt is then a 
submatrix of Pc consisting of a subset of its columns. Notice that matrices Pt and Pc have a natural 
partition into n^+i x rii block matrices which is induced by the block structure of matrix P. Next 



let Gb 



B 



and partition Gb into blocks as follows: 





r Pe(K+i],i) 




Pc([nj+i] ,ni) 


Gb = 


Gi{ni+i + 1,1) 




Gi{ni+i + l,ni) 










Gi{mi+i,l) 




Gi{mi+i,ni) 
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Next we define the vector 



ds = (4(l),--- , 4 (nj);rank(P), 4+1 (nj+i + !),••• , £i+i{mi+i)) . 



Since 

ni+i rii rii "ij+i 

rank(P) = ^£,+i(j) + X]^^W-^d. = X]^iW- E ^^+1^' 

j=l k=\ k=l j=ni+i+l 

it follows that 

rii m-i+i 

Y,Uk)= rank(P) + ^ 4+i (j) = Rb 

k=l j=ni+i+l 

and is a valid flow vector. Furthermore we have: 
Lemma 12. Matrix Gb supports flow ds- 



Let Gdg 



Pet 

Bt 



be the submatrix of Gb corrsponding to a solution for flow d^; here Pet is a 

p 



cr 

Bt 



, where Per is the submatrix of 



submatrix of Pc and Bt is a submatrix of B. We let G^^ = 

Pc consisting of the columns of Pc that are used for the construction of Pct\ Pet is then a submatrix of 
Per consisting of a subset of its rows. Finally, let Lt be the submatrix of L obtained by intersecting 
the set of columns with indices matching those of the columns of At with the set of rows with indices 
matching those of Bt. Observe that 



Per -^t 

Bt Lt 



is a submatrix of Gj. Our final step is the following lemma: 

Lemma 13. Matrix G^^ as defined above is a solution for flow dj for matrix Gi. 

To summarize the preceding argument, we have established the existence of a solution Gj. when 
rrii = 1, when mj+i = 1, and when mj > 2 and mj+i > 2. Our proof of Theorem 4 is complete. 



3 Proof of Theorem 6 

We prove a more general statement. Consider a network AA with an arbitrary number of nodes in 
the first and last layers. For a rate-i? flow vector d = (-^1(1), • • • •^i(mi); ^m(I), • • • ,-^m('tT'm)) we 
show that: 

Theorem 14. Network J\f supports rate-R flow d if and only if for every cut Q, 

Cin)> J2 ^1 (■?■)+ E ^M{k)-R. (9) 
Oi(j)en OM{k)en 

Notice that for M = 2, Theorem 14 reduces to Theorem 4. Also if M has a single node each 
in the first and last la3'ers, then ^i(l) = Cm(1) = P, and hence (9) and Theorem 1 imply that for 
every cut fl with 5 G and V E Cl, C{^) > C > R. Thus, Theorem 14 implies Theorem 6. 
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3.1 Proof of Theorem 14 



We use induction on M. For M = 2, Theorem 4 gives the result. For M > 2, choose K G 
{2, • • • , M — 1} . Define networks Ma and Mb to respectively be the subnetworks of with node 
set Oi U • • ■ U Ok and Ok U ■ • ■ U Om- The next step of our argument is to show that the induc- 
tive hypothesis and (9) imply the existence of non-negative integers ^/^-(l), • • • ,iK{fnK) such that 
Yli^ ^K{i) = R and networks Ma and Mb support the rate-i? flows 

dA = (^i(l),-- - ,ii{mi);iK{l),--- ,iK{niK)) 
ds = (^x(l),--- jKimK);iM{l),--- ,hi{mM))- 

This step would estabhsh that M supports flow d since submatrix can be obtained from the 
solution to Ma for z G {1, • • • ,K — 1} and from the solution to Mb for i G {K, • • • , M — 1} . 

By the inductive hypothesis if the desired ^a-(I), • • • ,iK{mK) exist then they are non-negative 
integers which form a feasible solution to the following system of Hnear constraints: 

(C{nA) > EoiO-)en^ ^i(j) + T,OK{k)enA ^^(^) ~ ^' ^'^^ ^^^^^ ™ 
I C{nB) > Y.OK(j)&^B ^K^j) + EoM(fe)6f^s ^Af(fc) - -R, for every cut Q.b in Mb, and (10) 



For any set T C Ok define: 



/^(r) = min ^ C(J1a) - ^ 4(j) + ^:C»xnnA = T[ (11) 



fB{T) = min < 



c{nB) - J2 ^Af(i) + R:OKr]nB = T 

OM{j)enB 



} ■ (12) 



Then the system (10) of linear constraints is equivalent to 



'Eo^O)er^i^(i) < fA{T), for every TCOk 

< Eoj,U)^T^K{j) < fB{T), for every TCOk (13) 

.j:T=i^K{i) = R. 

Lemma 15. The functions fA{T) and fB{T) are I) submodular, II) nondecreasing, and satisfy III) 
/a(0) = and fB{^) = 0. Notice that function f is submodular if for every Ti and T2: 

f{Ti) + /(Ts) > /(Ti n T2) + /(Ti U T2) 

and is nondecreasing if for every Ti C T2, /(Ti) < /(T2). 

Referring to terminology in polyhedral optimizations (see [24, §5.15]) a polytope P is integer 
if and only if each vertex of P has integral coordinates. If a polyhedron P = {x : Ax < b} in 
n dimensions is integer, then any linear programming problem max {c'^x : A'x. < b} with a finite 
solution must have a solution with integral coordinates. 

Let / be a submodular function on some set V with v elements. The polymatroid associated 
with / is: 

= {x G : X > 0, x{U) < f{U) for every U ^V} , 
where we define x = [ x(l) • • • x{v) and x{U) = Z^^eC/^l^)- 
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Theorem 16. ([9J) Let fi and f2 be nondecreasing suhmodular set functions with integer values on 
V with /i(0) = /2(0) = 0. Then Pf, D Pf^ is integer and for each U QV, 

max{x(C/) : X G P/, n P/J = min (/i(r) + f2{U\T)) . (14) 

For the submodular set functions /a and fs, define the polymatroids: 

= {x G M""^ : X > 0, x{T) < fA{T) for every T C Or} , 
= {x G M'"^ : X > 0, x{T) < feiT) for every T C Or} ■ 

For (13) to have a non-negative and integral solution, max{x {Ok) '■ x G P/^ n P/^} > P is clearly 
necessary. To show sufficiency suppose y G P/^ fl Pfg achieves maxja; {Ok) '■ x G P/^ PI Pfg} > R- 

Then for every choice of < (-kU) < y{j)-, [ ^k{^) ■ ■ ■ ^K{mK) ] G ^fA^^fs so we choose 
■^K{j) such that X^^^ ^if (0 = R- Lemma 15 and Theorem 16 imply that: 

max {x {Ok) : x G P/^ n P/,} = min (/^ (T) + (O^XT)) (15) 

and the optimum can be achieved by a non-negative integer solution. Theorem 14 follows from (15) 
and the following lemma: 

Lemma 17. minrco^ {fA{T) + fB{OK\T)) > P if and only if for every cut in J\f, 

C{n)> E ^M{k)-R. (16) 
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Algorithm 1 Submatrix 



Initialize: T^"' = T^' = Gi,t = 0. 

1. Look for a submatrix Tr*"*"^^ of Gi that can be obtained by removing a row from T^*^ and has 
the property that 

rank(r,(*+i)(M^,C/)) > ^^(j) + ^ - R 

jeu kew 

for all U C [rrii] and W C [mj+i]. If such a rr*"*"^-* exists, then t t + 1 and repeat 1. 
Otherwise Ti°^ ^ T^^\t 0, and goto 2. 

2. Look for a submatrix ri*"*"^^ of Gi that can be obtained by removing a column from T^*^ and 
has the property that 

rank(Ti*+i)(T^, U)) > J^^iU) + E " ^ 

jeer feeVK 

for all U C [mj] and C [mj+i]. If such a ri*"^"^^ exists, then i i + 1 and repeat 2. 
Otherwise output Gd, = ^i*^- 



Appendix 

Proof of Theorem 7: 

Suppose that matrix Gi and flow satisfy the conditions of Theorem 4. We first prove that 
Algorithm 1 will find a solution Gd^ for flow dj, and wc later discuss the complexity of the algorithm. 
Wc begin by showing that after exiting Step 1, matrix ri°^ has exactly X^j^i^ ^j+i(i) = R rows. 
By definition, rank(Ti°^([mi+i] , [rrii])) > Ylj^iU) + Efc^i+i(^) - R = R- Suppose that has 
more than R rows. By Theorem 4, Tc^^ has a submatrix Td. which is a solution for flow dj, and 
hence Td^ has R rows. Therefore all rows that belong to ri^-* but not to Td- can be removed and 
the resulting matrix T satisfies rank{T{W, U)) > Yljeu'^iU) + YlkeW ^i+'^i^) ~ ^ ^^'^ ^ — ["^«] 
and W C [mj+i]. This contradicts the assumption that ri°^ has no more rows that can be removed. 
We can similarly argue that the matrix Gd^ output at the end of Step 2 has R columns. Since Gd^ 
is an i? X i? matrix for which rank(Gd,(H^, U)) > J2jeU^iU) + J2keW ^i+^i^) ~ R ^^^^ ^ — ['^i] 
and W C [mj+i], Gd^ is a solution for flow dj. 

To find the complexity of Algorithm 1, we argue that at Step 1 or Step 2 a removable row or 
column can be found respectively by testing at most R+ 1 rows or columns. For example, in Step 1, 
an)' row which is not part of tJ*^^ can be removed without violating any of the stated rank conditions. 
The pigeonhole principle implies that at most i? + 1 rows need to be checked to find a row which 

is not part of ri°\ Define a(0) = rank(Gi(Jl)) - Eoi(j)eanOi ^iU) + Eoi+i(fe)eanOi+i ^i+ii^) for 
every cut CI C Oi Li Oj+i . Then the conditions of Theorem 4 are equivalent to the requirement that 
a(17) > for every cut $7 C U Oj+i. Lemma 18 in the appendix proves the submodularity of 
rank(Gj(r2)). By Lemma 20, which appears later in the appendix, the function — ^o,{i)ennOi + 
Eoi+i(A;)ennOi+i ^^^o a submodular function of $7. Thus a{^l), which is the sum of two 

submodular functions, is also submodular. We next verify whether or not the minimum of a{Q) 
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is non-negative. It is known (see, e.g., [23] and [14]) that the minimum value of a submodular 
function can be found in strongly polynomial time. Here we use the algorithm by Schrijver [23] 
that finds the minimum of a submodular function / on the power set of set Ef, in time 0{\Ef\^tf), 
where tf is the time for evaluating function / for some subset of Ef. In our problem / is a which 
is defined on Ea = OiLI Oj+i. If we suppose that \Oi\ < m, then lE'al < 2m. Every evaluation of a 
requires calculating the rank of submatrix with size at most the size of Gj. Suppose that Gi 

has dimension at most hox Hq. The rank of Gi{il) can then be evaluated, for instance, by Gaussian 
elimination in O (/jq) time [4]. The complexity of minimizing q;(Q) is therefore 

Oi\Eafta) = O {m%l) . 

The number of iterations of Step 1 of Algorithm 1 is at most the number of rows, ho. Likewise there 
are at most ho iterations of Step 2. Since each iteration requires at most + 1 minimization of the 
submodular function a, the total complexity of the algorithm is at most 

{2ho) ■{R+l)-0 {m^hl) = O {Rm^h^) . 

As we run Algorithm 1 for Gi, i G [M — 1] , the complexity of this part will be O (Mi?m^^o) • 

Proof of Theorem 8: 

First we prove that the capacity C of the network A/" can be computed in poh^nomial time. For any 
cut C V, = X^^j"^ rank(Gi(0)). By Lemma 18, rank(Gj(ri)) is a submodular function of fi. 
Thus C(ri) is a submodular function of fl. Next suppose that fli and are two cuts that separate 
S from v. Then ili n and 17i U ^2 separate S from V. Therefore C(J1) is a submodular function 
over all cuts ft C V that separate S from V. To evaluate C = in.mC{fl) over all cuts that separate 
S from V we use a submodular minimization algorithm [23] with running time of 0{\Ec\^tc). Here 
Ec = V\ {S, V} is the ground set of C(fi), and |i?c| < m(M — 2), where m is the maximum number 
of nodes in each layer, tc denotes the time for the evaluation of C{Vl) = ^^7^ ^'^^^(^*(^)) ^o^' ^ 
given Q,, and it involves Af — 1 rank evaluations. Recall that each matrix Gi has size at most ho x ho. 
Then Gi{^) has size at most ho x ho. Therefore using Gaussian ehmination for rank evaluation, we 
have tc = 0(M/iq). Therefore, the total complexity of computing the capacity C is ©(m^M^/ig). 

Next we discuss the complexity of the evaluation of the vectors dj. As we discussed in the 
proof of Theorem 14, in order to determine IkU) for a fixed K, we need to solve the optimization 
problem (15) for the vector x. This is an integer programming problem over the intersection of 
two polymatroids. Let E^ = {x{l), • ■ ■ ,x{mK)} ■ Then \Ex\ < m. Let tx be the time needed for 
one evaluation of functions fA{T) and fsiT) as defined in Section 3. By applying the result of [24, 
Theorem 47.1], it follows that x can be found in time 0(|£',x.l^tx) if we use the algorithm of [23] for 
minimizing an integer programming problem over a polymatroid. Recall that 

/^(r) = min I C(Oa) - ^ h{j) + R:OKnnA = T\ . 

Define for cut in Ma with Ok n 0^ = T, 0j'(r2^) = C(O^) — Yloi(j)edA Lemmas 
19 and 20 the functions C(f2) and — J2oi{j)£n ^i(i) + ^'^^ submodular over the set of all cuts in 
Ma- Next, if for two cuts i^i and CI2 we have Ok<~^^i = T and OkC^^2 = T then De Morgan's laws 
imply that Ok n {UTnTh) = T and Ok n (ili U = T. Thus (I)t{^a) is a submodular function 
over all cuts Q.a with Ok n VLa = T. Hence the evaluation of fA(T) involves the minimization of a 
submodular function. The complexity of such a minimization over a set E^^, is 0{\E^j,\^t^j,). Since 
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the nodes in Ok n = T arc already fixed, the set E^^ is the set of all nodes in Oi U • • • U Ok-i- 
Therefore \E^^\ < m{K — 1). t^^ is the time for an evaluation of C{VLa) — '}2oi(j)eVLj^ ^i(i) + ^ fo'^ 
a given cut VLa- Since C{VLa) = X^^"^"^ rank(Gj(r2yi)), there are K — 1 rank evaluations. Each rank 
function can be evaluated in time at most 0(/io) using Gaussian elimination. Thus t^^ = 0{KhQ) 
and fA{T) can be evaluated in time at most 

0{\E^^\%^) = O {Khl ■ {m{K - l) f) = O {K^m^l,) . 

Similarly, /b(T) can be evaluated in O ((M — Kym^h^) . The total time for evaluating and /b 
is therefore O (((-M — KY + K'^)m^h^ . Therefore evaluating vector x needs time 

O (m^ ■ ((M - kY + K'')m%l) = O (((M - + K^)m^\l) 

The function above is maximized when K = and the time complexity is O (^M'^ m^^ hf^) . Since the 
vector X needs to be computed for every layer i G {2, • • • , M — 1}, we find that the total complexity 
of the second stage is O (M^m^^/io) • Thus the total complexity of constructing the transmission 
scheme is 

O (MV^^/ig) + O {MRm^D . 

Proof of Lemma 9: 

Let T*^") = Gi and for f > 1 define T^*) to be a submatrix of Gi obtained by removing an arbitrary 
row from T^*"^). Observe that for every t > 1, C/ C [mj] , and W C [mj+i] , 

rank(rW(VF, U)) > rank(r(*-^)(T^, U)) - 1. (17) 

Suppose that (2) is satisfied with strict inequality for all U C [rrii] and W C [mj+i] . Then by (17), 
for all U C [rrii] and W C [mi+i] , 

If T^^) has a tight submatrix we are done. Otherwise, let T^^^ denote the empty submatrix of Gi. 
Observe that 

rank(rW(K+i] , [rm])) = < ^Uj) + ^ d+iik) - Rd, = Rd„ 

3=1 k=l 

and so the rows of Gi cannot be removed indefinitely without violating at least one rank condition. 
Therefore by (17) there must be some t < \ such that T = T^*) satisfies (7) for all (j and W and T 
has a tight submatrix. 

Proof of Lemma 10: 

Consider a tight submatrix Gi{W, U) of Gi that is not proper. Suppose, without loss of generality, 
that Gi{W, U) is of the form P = Gi{{l} , [rrii]). Remove rows from Gi arbitrarily among the row 
blocks with indices in {2,3,- ■■ ,mj+i}, until no further rows can be removed from the resulting 
submatrix T of Gi without violating rank(T(VF, U)) > Y^j^fj f-iU) + 4+i(^) — Rd, for some 

U C [m,;] and W C [mj+i]. Notice that this process terminates before we remove all rows from 
blocks with indices in {2,3, • • • ,mi+i} for if T is a matrix with rows only from row block 1, then 
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rank(r({2, • • • , m,+i} , [mi])) = < Y:T=i ^i(^) + EZt' ^i+^i^) ' Rd, = ^i+^i^). Next 

replace Gi with T for the rest of our argument. Since no row can be removed from d among the 
row blocks with indices in {2,3, • • • ,mi+i} without violating a rank condition, there exists a tight 
submatrix Q = Gi{W', U') of Gi which has a non-empty intersection with the row blocks of Gi 
with indices in {2, 3, • • • , mj+i} . Now consider some possibilities for Q. If Q is a proper submatrix, 
then we are done. If Q is not a proper submatrix consider two cases: 

• l^'l = 1 \U'\ = rrii. Without loss of generality suppose that Q = Gi{{2} , [mi]). Since 
P and Q are tight, (2) implies rank(P) = £i+i(l) and rank(Q) = 4+i(2). Next consider the 
submatrix of Gi given by 

E= ^ =Gi {{1,2}, [mi]). 

By (2), we have rank(£') > ^j+i(l) + ii+i{2). However, by considering the rows of E,P, and 
Q, we see that, T&nk{E) < rank(P) +rank(Q), and therefore rank(£') < 4+i(l) +^i+i(2). 
Thus, rank(£') = £i+i{l) +£i+i{2). E is therefore both a tight and a proper submatrix of Gi. 

• l^'l = nT'i+i and \U'\ = 1. Without loss of generality suppose that Q = ([mj+i] , {1}). 
We will next consider a collection of subcases. Suppose first that other than P and Q there 
exists another tight submatrix K. If K is proper then there is nothing further to prove. 

If K is not proper then without loss of generality wc can assume that K is cither of the 
form K = Gi{{2},[mi]) ot K = Gi {[mi+i] , {2}) . If K = Gi{{2} ,[mi]), then the matrix 
p 1 

is proper by our argument in the previous case. Likewise, if K = Gj ([mj+i] , {2}) 

then the matrix [ Q ] is proper. Suppose next that P and Q are the only tight subma- 
trices of Gi. We already have assumed that no other row from Gi{{2, 3, • • • , mj+i} , [mi]) = 
T{{2, 3, • • • , mj+i} , [mi]) can be removed without violating some rank condition (2). Since P 
and Q are the only tight submatrices of Gi, removing one row from Gj({2, 3, • • • , mj+i} , [mi]) 
will cause the violation of a rank condition only for the tight submatrix Q = Gi ([mj+i] , {1}) . 
If a row in Gi ({2,3, • • • ,mi-^-l} , {1}) was a linear combination of some other rows in Q it 
could be removed without violating any rank condition for submatrix Q. Since Q is a tight 
submatrix and we constructed T so that no further rows could be removed from it without 
violating a rank condition, this is impossible. Hence every row in Gi{{2, 3, • • • , mj+i} , {1}) is 
independent from all other rows in Q. Thus 

rank(Q) = rank(Gi(l, 1)) + n+i(2) + • • • + ^+1(771^+1), (18) 

where rj+i(j) is the number of rows of the jth row block of Gi. By inequality (2) for U = 
[mi] and W = {j} , we have rank(Gi({j} , [rrij])) > li+i{j) for every j e {2, • • • ,mi+i}. 
Furthermore, the rank of a matrix is at most the number of rows of the matrix, and hence 
ri+i{j) > ra.nk{Gi{{j} ,[mi])). Consequently ri+i{j) > A+i(j)- This together with (18) and 
the fact that rank((5) = 4(1) imply that 

rank(G,(l, 1)) < i^{l) - 4+i(2) 4+i(m,+i) = 4(1) + 4+i(l) - Rd,. (19) 

However, by evaluating inequality (2) for U = {1} and W = {1} we obtain 

rank(Gi(l, 1)) > 4(1) + 4+i(l) - Rd^ (20) 

(19) and (20) imply that 

rank(G,(l, 1)) = 4(1) + ^i+i(l) - Rdr 
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Hence 1) is tight, and this contradicts the assumption that P and Q are the only tight 
submatrices of Gi. 



Proof of Lemma 11: 

Let p be the index of the first column block of Ga and let + 1, • • • , respectively be the indices 
of the other column blocks. Furthermore let 1, • • • ,nj+i be the indices of the row blocks of Ga- 
Select any two subsets W C [nj+i] and U C {p, + 1, • • • , rrii} . We consider two cases: 

1. If p G i7 : consider the submatrix Ga{W, U). This submatrix is the same as GiiW, U') where 
U' = {U\ {p}) U [rii] . Since Gi{W, U') satisfies the condition of inequality (2) for vector dj, we 
have: 

rank(GA(VF, U)) > ^ + ^^(^) " ^d,- (21) 

j&w keu' 

If we expand the right hand side of (21), we have: 

jew keu\{p} t=i 

= ^^,+i(j)+ ^'(^)- E ^*(*) 

jew keu\{p} t=ni+i 

= Yii+i{j)+ Y 4(A;) - (i?A - rank(P)) (22) 
jew keu\{p} 

where (22) follows by (8). The last expression is the condition of inequality (2) for Ga{W,U) 
when Ga supports fiow d^- 

2. lip^U : define V = Ul}{p} so that we can apply (22): 

rank(GA(W^, V)) > Y ^i+iU) + Y ^^(^) + rank(P) - Ra. (23) 
jew keu 

Observe that 

rank(GA(W, U)) > rank(GA(W, V)) - rank(GA(W, {p})) 

> rank(GA(W, V)) - rank(P). (24) 

By (23) and (24), 

iank{GAiW, U)) > Y ^i+^U) + Y ^^(^) " 
jew keu 

which is the rank condition (2) for Ga{W, U) when Ga supports fiow d^- 

The block matrix Ga has block dimension rij+i x (m^ — + 1). Therefore, Ga has strictly fewer 
blocks than Gi unless nj+i = mj+i and mi — ni + 1 = mj or = 1. This is impossible since P is a 
proper submatrix of Gj. It follows from our induction hypothesis matrix Ga supports fiow d^. 
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Proof of Lemma 12: 



1, • • • ,ni. Define G'^ 



Let the indices for the row and column blocks of Gb respectively be n^+i + I,-- - ,mi+i and 

p 1 

^ . Observe that matrix G'^ supports flow since the only change 

needed to the proof of Lemma 11 is to take the transposition of all matrices. Next, since is 
a solution for the flow d^, then by definition of flow the matrix Pt has full column rank and has 
rank(P) columns. Therefore 

rank(Pt) = rank(P). (25) 
Since Pc is a submatrix of P and Pt is a submatrix of Pc, we have 

rank(P) > rank(Pc) > rank(Pt), 

and therefore 



rank(Pc) = rank(P). 



(26) 



This implies that the rows that are in P but not in Pc are linear combination of the rows of Pc- It 
follows that for every U C [m] the rows that are in P([ni+i] , U) but not in Pc([ni+i] , U) are linear 
combination of the rows of Pc([rai+i] , U). Hence 



rank(Pe([ni+i] , U)) = rank(P([ni+i] , U)). 

This implies that for every W C {p, nj+i + 1, • • • , mj+i} , if p G W then Gb{W, U) = 
and 



Pc{[ni+i\,U) 
Gi{W\p,U) 



rank(GB(W^,C/)) = rank 



Pc{[ni+i],U) 
Gi{W\p,U) 



rank 



P{[ni+i],U) 
Gi{W\p, U) 



rank(Gs(W,C/)). 



llp^W, Gb{W,U) does not depend on P^ and x<mk{GB{W,U)) = xim^G'^iW^U)). Therefore in 
general for ever)' W and ?7, replacing P^ with P m Gb will not change any rank function and we 
can still use the result of Lemma 11. 



Proof of Lemma 13: 

We have to verify two properties of G^^. The first is that every row block j G [rrij+i] of G^^ 
has ^i+i(j) rows and every column block k G [rrii] has li{k) columns. The second property is that 
rank(GdJ = i2d, • By our construction, the number of columns in column blocks j G {n.j + 1,--- ,mi} 
and the number of rows in row blocks k G [n^+i] are respectively determined by GdA- Since 

dA = (rank(P),^i(ni + l),h{ni + 2), - • • , ^j(mi); £i+i(l), • • • ,^i+i(ni+i)) , 

there are the right number of rows and columns in these cases. Furthermore, the number of columns 
in column blocks j G [nj] and the number of rows in row blocks i G {nj+i + 1,--- ,mj+i} are 
respectively determined by Gd^. Since 

ds = (^j(l), • • • ,£i(ni);rank(P),4+i(ni+i + !),••• ,4+i(?^i+i)) 

there are the right number of columns and rows in these cases as well. Hence every row block 
j G [mi+i] of Gdj has 4_|_i(j) rows and every column block k G [mj] has li{k) columns. 
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For the second property, observe that since Gd^ is a solution of flow for Gb, then by the 
definition of the solution of a flow the matrix Pet has full row rank and has rank(P) rows. Therefore 

rank(Pct) = rank(P). 

Since Pet is a submatrix of Per consisting of a subset of its rows, rank(Per) > rank(Pet). Because 
Per is a submatrix of P, rank(Per) < rank(P). Thus 

rank(Pe^) = rank(P), (27) 

and so the rows that are in Per are linear combinations of the rows in Pet- Next since the rows of 
are also rows of the full-rank matrix G^g , it follows that all rows of Bt are linearly independent and 
are independent from all other rows of Gdg and consequently from all other rows in G'^^ . Therefore, 
all rows in [ Bt Lt ] arc linearly independent and are independent from all other rows in Gdj- 
Therefore the two relationships follow: 

rank(GdJ = rank ( [ P* ] ) + rank {[Per At]), (28) 
rank([ Bt Lt ]) = ^ 4+i(j). (29) 

j=ni+i+l 

Notice that by (26) and (27) rank(Pcr) = rank(Pc). Since Per is a submatrix of Pc consisting of a 
subset of its columns, the columns of Pe are linear combinations of the columns of Per- Therefore 

rank ([ Per At ]) = rank ([ Pe At]). (30) 

By (25) and (26), rank(Pe) = rank(Pt). Recall that Pt is a submatrix of Pc consisting of a subset of 
its columns. Therefore the columns of Pe are linear combinations of the columns of Pj. Hence, 

rank ([ Pe At ]) = rank ([ Pt At]). (31) 

Since Gd^ = [ -^t ^4^ ] is a solution for flow d^, it follows that rank (Gd^) = J2]=i -^i+iU)- Thus, 
(30) and (31) imply 

rank([ Per At ]) = ^£i+i{j). (32) 
Prom (28), (29), and (30) we conclude that 

rank(Gd,)= 4+i(i) = i?d,. 

Proof of Lemma 15: 
Part I: 

Here we only prove the submodularity of function fA{T) as the proof for /b(T) is similar. Consider 
layers Oi and Oj+i and transfer function Gj. Recall that Gj(ri) is the transfer function from the 
nodes in CI to the nodes in fi = {Oi U Oj+i) We first prove that: 

Lemma 18. rank{Gi{^l)) is a submodular function over cuts Q C. OiU Oj+i; i.e., for every two 
such cuts Qi and Q2 '■ 

rank{Gi {fli)) + rank{Gi (^2)) > rank{Gi {Qi n O2)) + rank{Gi {fli U ^2)) ■ 
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Q2 nO,^ 



Q 2 n O,. 




' (fi, nn2)nO,. 



5, 



(a) (^) 

Figure 2: (a) The matrices Gi{Cli) and Gi{Cl2), (b) the matrices fl ^^2) and Gi(Oi U ^2)- 



Wc point out that the preceding result was first proved in [6] in order to study the time- 
expanded representation of a network which is not layered. [6] established Lemma 18 through an 
information theoretic argument involving the submodularity of the entropy function. We offer a 
new and combinatorial proof of Lemma 18. 

Proof. Consider matrix Gi in Figure 2- (a). Suppose that we have reordered the row blocks of Gi 
and the column blocks of Gi such that the blocks corresponding to the transfer matrices of Gi{Cli) 
and Gi{^2) appear as in Figure 2-(a). We have depicted and labeled the different parts of the 
transfer matrices Gi{ili fl O2) and U O2) in Figure 2-(b). Therefore we have: 



Gi{ni n n2) 



A2 

c 
B2 



Gi{ni un2) = [Ai c ] . 



We first prove that: 



rank (Gi(Oi)) + rank (C) > rank ( [ C ] ) + rank 



A2 
G 



Let p be the maximum number of rows ai, • • • , Op in ^42 which are independent in 



ai 



C 



(33) 



and q be 



the maximum number of rows 61, • • • , 6q in Gi{Q,\) but not in [ C ] which are independent in 



"9 
Ai G 



. We have: 



rank 



A2 
G 



rank (C) + p, 



rank (Gj(ni)) = rank ( [ ^1 G ]) + q. 



(34) 
(35) 
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Let a'l, • • • ,a'p denote the rows in Gj(r2i) which respectively end in the rows ai, • • • , of A2. Then 



these rows are clearly independent in 



Ai C 



, and therefore q > p. By (34) and (35), 



A2 
C 



rank (Gi(ni)) - rank {[ Ai C ]) > rank ^ 
which implies (33). A similar argument for Gi{^l2) implies that: 

rank (Gj(J72)) + rank(C) > rank ( [ C Bi ]) + rank ( 



rank (C) , 



C 
B2 



(36) 



By adding together inequalities (33) and (36) we find that 

rank (Gi(ni)) + rank (^^(^2)) > (rank {[ Ai C ]) + rank {[ C Bi ]) - rank(C)) 



+ I rank 



A2 
C 



+ rank 



C 
B2 



rank(C) 



(37) 



If we use the submodularity of the rank function of a matrix [25] we deduce that if Wi and W2 
arc the indices of rows (columns) of some matrix and rank(VF) is the number of independent rows 
(columns) among those with indices in W , then: 

rank(Vl'^i) + rank(VF2) > rank(W^i n W2) + rank(VFi U W2). (38) 

Applying (38) to the columns of matrix Gi{Q.i U Q.2) and to the rows of matrix Gi{Q.i fl Q.2) we find 
that: 



rank([^i C])+rank([C 5i ] ) > rank(C) + rank (G^ (Jli U Jla)) 



rank 



A2 
C 



+ rank 



C 
B2 



> rank(C) + rank {d (fti n ^2)) . 



(39) 
(40) 

□ 



Lemma 18 follows from (37), (39), and (40). 

Next we extend Lemma 18 to a multilayer network Ma '■ 
Lemma 19. In the network Ma, C(i7) is a submodular function over cuts C Oi U ■ ■ ■ U Ox- 
Proof. We decompose $7 into the subsets O = Oi U O2 U • • • U ^k-i, where = Q n {Oi U Oj+i) 
defines a cut of the subnetwork of Ma with set of nodes U Cj+i. We have 

K-l K-l 

C{n) = ^ rank(G,(J7)) = ^ rank (G,(J7,)) . 

i=i i=i 

If two cuts O and O' are respectively decomposed into [Qi] and [O^], then Qnfl' and O U $7' will be 
respectively decomposed into [O^ n O'J and [Oj U O^]. By Lemma 18, rank {Gj{Qj)) is submodular. 
Therefore 

C{n) + C{n') = (rank {Gj{nj)) + rank {Gj{n'j))) 

K-l 

> (rank {Gj{nj n n'^)) + rank {Gjifij u n'j))) = c(% n n'j) + c{nj u n'j). 

.7 = 1 

Since the sum of submodular functions is submodular, the final result follows. □ 
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We next prove the following useful lemma: 

Lemma 20. For any function f defined on the set E and any given set V C. E, the function g on 
the power set of E defined as g{U) = YlieUnv /(^) submodular. 



Proof. By definition, 



g{U,)+g{U2)= Yl /(^)+ E /(^) 
k&Uinv k£U2r\V 

E /(^)+ E /(^) 

ke{Ul(^v)^^{U2r\V) ifc6({/iny)u({/2ny) 

= E /w+ E 

fce((7in!72)ny fee(!7iu!72)ny 
=g{Uir\U2) + g{UiUU2). 



□ 



We next prove that is submodular. 
Proof. Suppose Ti,T2 C Ok-, and let Ai and A2 be the two cuts in Ha such that 

/A(ri) = c(Ai) - + Oi^nli = ri 

fA{T2) = c{A2)- + Oi^nl2 = r2 

Consider Ai n and ^1 U A2. By De Morgan's laws, Ok n (Ai n ^2) =Ok^ U A2) = Ti U T2 
and Ok n (Ai U A2) =OKf^ (^1 n ^2) = Ti n T2. By (11) we have 

fA{TiVJT2)<C{Air\A2)- Y + (41) 

fA{TinT2)<C{AiUA2)- Y + (42) 

Oi{j)eAiLlA2 

By the submodularity of C{Cl) we have 

n A2) + C(Ai U ^2) < C{Ai) + C(A2). (43) 

Furthermore 

E ^i(^')+ E ^i(i)= E ^i(j')+ E ^i(^')- (44) 

Oi{j)eAinA2 Oi(j)eAiuA2 Oi(j)eAi Oi(j)eA2 

By (41)-(44), we have 

/a(Ti n T2) + /a(Ti U r2) < fA{Ti) + fA{T2). 

□ 
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Part II: 

We prove the result for /a and it is straightforward to modify it for /b- It suffices to prove that for 
every TCOr and every Oxii) i T, 

fA{T)<fA{Tu{OK{i)}). 
Suppose that for T and Oxii) ^ T, cut A in Ma achieves Ja {T U {OK{i)}) ■ We have 

fA{TVJ{OK{i)}) = C{A)- Hj) + R, OKnA = TU{OK(i)}- (45) 

Next consider the cut A' = AU {OK{i)} ■ Notice that A'nOK = T. Therefore 

C{A')~ Yl ^iij) + R>fA{T). (46) 

Oi{j)&A' 

Observe that for every j < K — 2, Gj{A) = Gj{A'). Furthermore Gk-i{A) has the same row blocks 
as Gk-i{A') and an additional row block corresponding to the transfer function from the nodes 
in ^ n Ok-1 to {OK{i)} . Therefore C{A) > C{A'). Finally, EoiO)eA^i(i) = EoiO)6A' and 
hence 

C{A)- J2 hij) + R>C{A')- Y m + R- (47) 

Oi{3)eA Oi(i)6A' 
(45), (46), and (47) imply that /^(T) <fA{TU {Oi^(i)}) . 

Part III: 

Since I]cii(j)Gf7^ ^i(j) < R and J2oM{k)&nB ^^^^^ - ^"^^ non-negative functions. 

Observe that by choosing cut ^ = Oi U • • • U Ok for network Ma and cut 5 = for network Mb, 
we find that An Ok = ^, B (1 Ok = ^, and 

/a(0)=C(A)- J2 Hj) + R = fB{%) = C{B)- Y iMij) + R = 0. 
Oi{j)eA OM{j)eB 

Proof of Lemma 17: 

Suppose that To achieves minTcOj^ {fA(T) + fB{OK\T)) and cuts ^Ia and ^b respectively achieve 
fA{To) and fB{OK\To) in Ma and Mb- Then 

fA{To)=C{nA)- Yl ^i(i) + ^' OKnnA = To (48) 

Oi{j)<=nA 

fB{OK\To)=c{^B)- Y ^mU) + R, OKnnB = OK\To. (49) 

Let ri = ^Ia U Ob be a cut in network M. Since Ok n Q,a = Ok n Ob = Ok\To, it follows that 
Gi{n) = Gi{nA) for i e {1, • • • , if - 1} and Gi{n) = Gi(OB) ior i e {K,- ■ ■ ,M - 1} . Thus we 
have 

C(0) =C(0a) + C(0s). 
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Prom (48) and (49) we have 

fAiTo) + fBiOK\To)=Cin)- Yl ^lO')- E ^Mij) + 2R. (50) 

Oi{j)enA OMimCiB 

Since EoiOOen^ ^lO') = J2o^(j)en^iU) and T.o,{j)enB ^^U) = T.OM{j)eCi^M{j), (50) and (16) 
together imply /a(7o) + fB{OK\To) > R, as desired. 

To prove the converse, consider a cut J7 in network J\f and partition it into two cuts Qa and Qb 
in networks Ma and Mb respectively. Let Tq = ClD Ok- Then by definition 

fA{To)<c{nA)- Yl + OKnnA = To 
fB{OK\To)<c{nB)- ^M{j) + R, OKnnB = OK\To. 

Therefore 

< min {fA{T) + fB(OK\T)) < fA{n) + /^(OiATo) 

TCOk 

<cinA) + c{nB)- Y E ^Mij) + 2R. 

If we substitute C(nA)+C(nB) = C{Q) and Eoi(i)eQ^ ^lO') = EoiO)en^iO') and EoMOOefis ^^(^') = 
Eoj(^(j)gQ ^M(j) into the preceding expression, we obtain 

c{^)> Y Y ^M{j)-R, 

Oi{j)en Oi{j)en 

which is the final result. 
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